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Abstract. Reflecting boundary conditions cause two one-dimensional random 
walks to synchronize if a common direction is chosen in each step. The mean 
synchronization time and its standard deviation are calculated analytically. Both 
quantities are found to increase proportional to the square of the system size. 
Additionally, the probability of synchronization in a given step is analyzed, which 
converges to a geometric distribution for long synchronization times. From this 
asymptotic behavior the number of steps required to synchronize an ensemble 
of independent random walk pairs is deduced. Here the synchronization time 
increases with the logarithm of the ensemble size. The results of this model are 
compared to those observed in neural synchronization. 



PACS numbers: 05.45.Xt, 05.40.Fb, 84.35. +i 



1. Introduction 

Synchronization of neural networks by mutual learning [1,2] has recently been applied 
to cryptography [3]. Two neural networks which are trained by their mutual output 
bits can synchronize to a state with common time dependent couplings [4]. It has 
been shown how this phenomenon can be used to generate a secret key over a public 
channel [5,6]. 

The networks used for neural cryptography are tree parity machines [7] consisting 
of K hidden units, which are discrete perceptrons with independent receptive fields 
made up of N binary input neurons. The influence of each input value on the output 
of the neural network is determined by the corresponding weight, which is an integer 
number in the range between — L and +L. Since the synaptic depth of the neural 
networks is limited by the parameter L, there are only m — 2L+ 1 possible values for 
each weight. 

In each step a set of common inputs is chosen randomly for all tree parity machines 
involved in the process of synchronization. Then the weights are updated according 
to a suitable learning rule [8]. Thereby the change of each weight can be either +1, 
or — 1 depending on the corresponding input value and the calculated output of the 
neural networks. If an update is not possible because of the limited synaptic depth, 
then the affected weight is left unchanged in this step. That is why the process of 
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Figure 1. Random walks with reflecting boundaries. 



synchronization is identical to an ensemble of random walks with reflecting boundaries, 
driven by pairwise identical random signals and controlled by mutual output bits [4,6]. 

Therefore, to better understand neural cryptography, it is the aim of this paper to 
investigate the process of synchronization of random walks in some detail. We neglect 
the effect of the control bits, which is essential for cryptography [9], but we derive 
exact solutions for the synchronization times. In addition, since synchronization is 
an active field of research (for an overview see [10]), synchronization of random walks 
may be of interest in other contexts, too. 

In this paper we analyze the model shown in figure ^ Two random walks can 
move on a one-dimensional line with m sites. In each step a direction, either left or 
right, is chosen randomly. Then both random walkers move in this direction. If one 
random walk hits the boundary, it is reflected, i.e. the random walk remains at its site 
on the left or on the right. As the other random walker is not affected, the distance d 
between both decreases by 1 at each reflection. Otherwise d remains constant. 

The important quantity in this model is the synchronization time T, which is 
defined as the number of steps until d reaches zero. In sectionElwe calculate the mean 
value and the standard deviation of this random variable analytically. Because large 
fluctuations of the synchronization time are observed in simulations, the average value 
of T does not describe the behavior of the model sufficiently. Therefore, we estimate 
the probability distribution of the synchronization time in section |3 Using this result 
we are able to calculate the number of steps T/v, after which an ensemble of N random 
walk pairs reaches a synchronized state. This quantity is equal to the maximum of 
T observed in N independent samples. Therefore, we use methods of extreme order 
statistics to calculate the distribution of T/v, which is shown in section 0] 



2. Average synchronization time 

In order to calculate the mean value (T) of the synchronization time, the synchroniza- 
tion process is divided into independent parts, each of them with constant distance d. 
We first compute the average number (Sd }Z ) of steps until a reflection occurs, which 
decreases d by 1. Of course, this value depends on the initial position of the two 
random walkers. We use the distance d between both walkers and the position z of 
the left walker to describe these initial conditions. 

If the first move is to the right, a reflection only occurs in the case of z = m — d. 
Otherwise the synchronization process continues as if the initial position had been 
z + 1. Under this condition, the average number of steps with distance d is given by 
(Sd,z+i + 1)- Similarly, if the two random walkers move to the left in the first step, 
this quantity is equal to (Sd, z -i + 1)- Averaging over both possibilities we obtain the 
following difference equation: 

(S d , z ) = ~(S d , z ^) + ~(S d , z+1 ) + l. (1) 
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Reflections are only possible if the current position z is either 1 oim-i In both 
situations d changes with probability | in the next step. In order to take this into 
account we have to use 

Sd,o = and Sd, m -d+i = (2) 

as boundary conditions. 

Equation is identical to the classical ruin problem [11]. Its solution is: 

(S d ,z) = (m-d+l)z-z 2 . (3) 

In a similar manner we obtain a difference equation for (S^ z ), which is used later 
to calculate the standard deviation of the synchronization time: 

(Si,) = l((Sd, z -i + I) 2 ) + l((S d , z+ i + l) 2 ) . (4) 

From an d JH we obtain the following relation for the variance of Sd,z'- 

iq2 \ iq \2 ( S d,z-i) - (S d , z -l) (S d , z +i) ~ ( s d,z+i) 2 „ ,2 

(Sd, z ) ~ (Sd,*) = g + 2 + {m-d+l-2z) .(5) 

Applying a Z-transformation yields the solution 

(SL) - (5, 2 ) 2 = (w - d + 1 7 )2 + ^ 2 - 2 (S d , z ) . (6) 

With these results we can calculate the average value of the synchronization time 
Td. z for two random walks starting at distance d and position z. The first reflection 
occurs after Sd. z steps. Then one of the random walkers is located at the boundary. 
As our model is symmetric, both possibilities z = lorz = m- d are equal. Hence the 
second reflection takes place after S d , z + S d -i,i steps. So the total synchronization 
time is given by 

d-l 

Td.z — Sd.z + ^ Sj.i . (7) 

j'=i 

Using we obtain 

(Td.z) = (m-d+l)z-z 2 + -(d - l)(2m - d) (8) 

for the expectation value of this random variable. The variance of Td. z can be 
calculated in a similar manner, because the parts of the synchronization process are 
mutually independent. 

Finally, we have to average over all possible initial conditions in order to calculate 
the mean value and the standard deviation of the synchronization time T for randomly 
chosen starting positions of the two random walkers: 

(t) 4eew=^ ! t )! + ! ^. ^ 

m z *■ — ' * — ' o Am 

d=l z=l 

p-, . ^ f "f ps.) . 17 "' 5 - 5 "" 4 + 65 "- J - 45 '" 3 + 8 - +6 . no, 

m ^— ' ' 90m 

d=l z = l 

So the average number of steps required to reach a synchronized state, which is 
shown in figure [2 increases nearly proportional to m 2 . In particular for large system 
sizes m, we obtain the asymptotic relation 

(T) - \m 2 ~ \L 2 . (11) 
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Figure 2. Synchronization time of two random walks as a function of the system 
size m. Error bars denote the standard deviation observed in 1000 simulations. 
The analytical solution from J^J is plotted as dashed curve. 



This result is consistent with the scaling behavior (T) oc L 2 observed in neural 
synchronization [5]. 

Numerical simulations, both for random walks and neural networks, show large 
fluctuations of the synchronization time. The reason for this observation is that not 
only the mean value, but also the standard deviation 



/7m 6 — 11m 5 — 15m 4 + 55m 3 — 72m 2 + 46m — 10 

°* = V 90^ (12) 

of the synchronization time increases with the extension m of the random walks. Here 
we find that o~t is asymptotically proportional to (T) : 

<7T~yj(T). (13) 

Therefore, the relative fluctuations ar/iT) are nearly independent of m and not 
negligible. 

3. Probability distribution 

The synchronization of two random walks is a Markov chain, because the next step 
only depends on the current state of the system. We can, therefore, derive an equation 
for the distribution of the total synchronization time T. 

For this purpose we introduce hm(m + 1) variables Pd,z(t), which are defined as 
the probability to find the two random walkers at the positions z and z + d in time 
step t. In the initial ensemble the positions are chosen randomly, hence all variables 
are set to 

PdA0) = l^. (14) 
m^ 

The development of the probabilities in each step is given by the following equations 
for < d < m — 1 and 1 < z < m — d: 

Pd,z{t+l) = -Pd,z-i{t) + ^Pd,z+l{t), (15) 
Pd,i(t + l) = -Pd,2{t) + ^pd + i,i(t) + ^6 d ,opo,i, (16) 
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Figure 3. Probability of synchronization P(T = t) as a function of the number 
of steps. The symbols show the result of the numerical calculation for m = 3 (O) 
and m = 5 (□). 



Pd,m-d{t+ 1) = -zPd,m-d-\{t) + -^Pd+l,m-d-l(t) + -^S d fiP0,m , (17) 
p m -l,l(t + l) =0. (18) 

The probability P(T = t), that the two random walks synchronize in the time 
step t > 0, is given by 

m 

P(T = t) = (Po,,(*) ~ Po,.(* - 1)) ■ (19) 

2 = 1 

Using H15(l - (|18[) we deduce the expression 

P(T = = - 1) + ~Pi,m-i(* - 1) • (20) 

These equations can easily be iterated numerically. 

Figure |3 shows the results of these calculations for m — 3 and m = 5. Because 
of the randomly chosen initial conditions there is a probability of P(T = 0) = 1/m 
that the two random walkers even start synchronized. Furthermore, one notices that 
P(T = t) is constant and equal to 2/m 2 in the range < t < m. 

In principle, it is also possible to calculate the probability distribution P(T = t) 
analytically. We use this method to estimate an approximation of P(T — t), which 
is asymptotically exact for very long synchronization times t 3> m. For that purpose 
we start with a result known from the solution of the classical ruin problem [11]: The 
probability that a fair game ends with the ruin of one player in time step t is given by 




In our model a — 1 = m — d denotes the number of possible positions for two random 
walkers with distance d. And u(t) is the probability distribution of the random variable 
S d , z , which we introduced in section As before, z is the initial position of the left 
random walker. 

According to J7J) the synchronization time Td, z for fixed initial conditions is the 
sum over Sij for each distance i from d to 1. Therefore, its probability distribution 
P(2d,2 = t) is a convolution of d functions u{t) defined in (|2ip. The convolution of 
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two different geometric sequences b n = b n and c n = c™ is itself a linear combination 
of these sequences: 



n-1 

£ 

3=1 



(22) 



Therefore, P(?d,z = t) can be written as a sum over geometric sequences, too: 

fc-7r\ 



a 



(23) 



a=m—d J r 1 fc=l 

In order to obtain P(T = i) for random initial conditions, we have to average over 
all possible starting positions of the random walkers: 

i— 1 m — d 

(24) 

d=l z=l 



So we can even write 

m a—1 



cos 



a 



t-i 



(25) 



a=2 k=l 

as a sum over many geometric sequences. 

For long times only the terms with the largest absolute value of the coefficient 
cos(fc7r/a) are relevant, because the others decline exponentially faster and can be 
neglected for t — > oo. Hence the asymptotic behavior of the probability distribution 
is given by 

t-i 



P(T = t) ~ [q m<1 + {-iy- l q m , m -i] 



(- 

Vto 



(26) 



The two coefficients q m ,i an d Qm,m-i m this equation can be calculated using l|22|) . 
This leads to the following result: 



Qm, 1 



sin 2 (7r/rn) 



2 d +\m-d)\ 



m 2 m\ 



1 — cos(7r/m) 
j-| v > sin 2 (fc7r/2) 



d=l 

rn — 1 a—1 

y . 

, f— ' cos(7r/m) - cos(A:7r/a) 1 - f> a .m-d+i cos(fc7r/a) 

a—m—d+l k—1 



sin 2 (fc7r/a) 



(27) 



Qm,m — 1 



sin 2 (7r/m) cos 2 (m7r/2) 



m — 1 a — 1 



f-f 1 + cos(tt 

d=l ' v 



(ir/m) 



n 



sin 2 (A:7r/2) 



sin 2 (fe7r/a) 



^— ' cos(7t/to) + cos(kn/a) 1 — <L 

a— m— fc— 1 



m— d+1 



cos(fc7r/a) 



(28) 



which is also shown in figure 

For odd values of to, the coefficient q m , m -i = 0. In this case P(T = t) asymp- 
totically converges to a geometric probability distribution for long synchronization 
times: 

t-i 



P(T = t) ~ q m ,i [cos (-) 
L Vto/ 



(29) 



If to is even, we have to consider oscillations of P(t) around the function given by 
<|29[1 . because q m ,m-i 0- But for to > 8, the coefficient q m ,m-\i which determines 
the amplitude of these oscillations, is smaller than 10 _3 g mj i. As shown in figure 
H29fl is a good approximation of the asymptotic behavior for even values of to, too. 
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Figure 4. Value of the coefficients g m ,l (O) an d 9m,m-l (d) as a function 
of m. The parameter g m ,i can be approximately calculated using the function 
9m, l ~ 0.324m[l — cos(7r/m)], which is shown as dashed curve. 
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Figure 5. Probability distribution P(T = t) of the synchronization time for 
m = 10. The numerical result is plotted as full curve. The dashed line denotes 
the asymptotic function defined in 1291 . 



4. Extreme order statistics 

In this section we extend our model to ensembles of random walks. We consider N 
independent pairs of random walkers driven pairwise by identical random noise. Each 
pair is finally synchronized by the effect of the reflecting boundary conditions. From 
the results discussed above we know the expected average synchronization time. But 
the important quantity in this case is the number of steps TV until all random walks 
are synchronized. Of course, this random variable is equal to the maximum value of 
T observed in N independent samples. 

From the distribution function P(T < t) we can easily deduce the probability 
distribution of TV: 

P(Ttv < t) = P(T < t) N . (30) 

Therefore, we can calculate the average value (TV) using the numerically 
computed distribution. The result, which is shown in figure El indicates that (TV) 
increases proportional to In TV: 



(TAr)-(T)cxln7V. 



(31) 



Synchronization of random walks with reflecting boundaries 



200 r 



150- 



1 100- 



50 



10" 



10' 



io- 
N 



3 4 

10" 10 



Figure 6. Average synchronization time (TV) as a function of N for m = 7. 
Results of the numerical calculation using l30l are represented by circles. The 
dashed line shows the expectation value of TV calculated in 1351 . 



Similar behavior has been observed in neural synchronization [4,6]. 

For large N only the asymptotic behavior of P(T < i) is relevant for the 
distribution of Tjv- The exponential decay of P(T = t) in (|26|) yields a Gumbel 
distribution for P(T N < t) [12], 



G(x) = cxp I— e~$~ 

for N 3> m with the parameters 

Nq m ,i 



a = (3 In ■ 



and (3 = — 



1 



1 — cos(7t/to) 
Substituting into (jo^ we get 

Nq m ,i cos'(7r/m 



lncos(7r/TO.) 



P(T W < t) = cxp - 



(32) 



(33) 



(34) 



1 — cos(7t/to) 

as the distribution function for the total synchronization time of N pairs of random 
walks (N ^S> to). The expectation value of this probability distribution is given by [12] 



(T N ) = a + fa = 



1 



7 + In N + In 



9m, 1 



In cos(7t/to) 

In this equation, 7 denotes the Euler-Mascheroni constant, 
obtain the relation 

2 0/ , „ , 2to 2 



.(35) 



1 — COs(7t/to) 

For N » to » 1, we 



(Tjv) - ^to 2 7 + lnAr + ln 



9m, 1 



(36) 



which shows that (T/v) increases asymptotically proportional to to 2 In N. Using the ap- 
proximation for g m ,i shown in figure^! we find (T/v) ~ (2/7t 2 )to 2 (In N + ln(0.577m)). 

Neural cryptography is somewhat more complex than our model with random 
walks. Since in this case the movements of the random walks are controlled by 
learning rules [8], there are also steps without changes of the weights. These are not 
included in our calculation of the synchronization time. Additionally, repulsive steps 
destroying synchronization [7] are possible, too. Nevertheless, the synchronization 
time of neural synchronization scales like (T/v) oc L 2 \nN [5], in agreement with l|36|l . 
And its probability distribution, obtained from numerical simulations and shown in 
figure [7| is described well by a Gumbel distribution where the two parameters a and 
f3 are fitted to the data. 
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Figure 7. Probability distribution of the synchronization time for two tree parity 
machines with the parameters K = 3, L = 3 and N = 1000. The histogram shows 
the relative frequency of occurrence observed in 10 000 simulations. The thick 
curve represents a Gumbel distribution for a = 355.8 and (i = 84.5. 



5. Conclusion 

We have analyzed the synchronization of two random walks with reflecting boundary 
conditions. We calculated the mean value and the standard deviation of the 
synchronization time T analytically for randomly chosen initial positions of the two 
random walkers. The average number of steps until synchronization increases with 
m 2 , the square of the system size. Additionally, the standard deviation ut also scales 
with to 2 , which shows that fluctuations of T cannot be neglected even if m is large. 

The probability P(T = t) that two random walks synchronize at t steps has been 
derived. For long synchronization times t ^> m the asymptotic behavior of P(T = t) 
is given by a geometric probability distribution with parameter p = 1 — cos(7r/m). 

We have also studied the number of steps T N needed to synchronize N 
independent pairs of random walks. The average value of this random variable 
increases with In AT. And, for large values of N, the probability P(T/v < t) that t 
steps are sufficient for synchronization is given by a Gumbel distribution. 

Finally, our model is able to reproduce the scaling behavior observed in neural 
cryptography. We find that Tn increases nearly proportional to L 2 IniV. Additionally, 
even for tree parity machines, the distribution of the synchronization times of neural 
cryptography is described by a Gumbel distribution. 
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